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Internet Access to Large Government Data Archives: The Direct Edgar Access 

System 

2. Project Summary 

This project will develop and demonstrate ways to post large government 
data arcliives on the Internet for access by researchers and the general public. 
The two-year project will use the Securities and Exchange Commission (SEC) 
EDGAR Database as a test database and will develop and implement 
techniques to provide low-cost management and dissemination of these data 
archives. The goal of the project is to develop techniques that can be used 
directly by government agencies and other information providers, giving 
these organizations the tools and knowledge to allow them to place 
government information on the Internet for direct access by researchers and 
the general public. Secondary goals of the project are to make key financial 
archives from the SEC available to researchers and the general public who 
would not otherwise have access to the information. 

The Direct EDGAR Access System will be under the direction of principal 
investigator Dr. Ajit Kambil of the New York University Stern School of 
Business. The project starts with the EDGAR Dissemination Service, a raw 
data feed intended for information wholesalers and retailers to process. The 
project will develop techniques for automatic reformatting of this 
information into file systems appropriate for dissemination on the Internet 
and will maintain an Internet-based server capable of supporting 1.0 Mbps of 
application level data throughput. 

The project will then work with public domain tools to adapt them to the 
specific needs of large government data archives. Custom WAIS 
vocabularies, support for multicast based data dissemination, and semi- 
automatic structuring of hypertext and Gopher links will all be investigated. 
The project will obtain advice from researchers in the Finance department of 
the Stern School to identify ways to provide semi-automatic value-added 
cataloging and data abstraction. 

The data conversion, access and management techniques learned in this 
project will be fully detailed in project reports and all software and techniques 
will be placed in the public domain. 
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4 Project Description 

4.1. Access to Government Data 

Federal and state government agencies are all tackling the very difficult issue 
of making key information available to the public over computer-based 
information systems. This project develops tools and techniques that 
enhance the ability of government agencies to make their information 
available to the public over the Internet computer network. 

The Internet computer network is a key U.S. priority, receiving substantial 
backing from agencies ranging from the National Science Foundation to 
NASA to the Department of Defense. This infrastructure is widely available 
in university, research, and government institutions. The Internet and 
related initiatives are a key part of the emerging National Research and 
Education Network. 

While the network is already in place, there is considerable work to be done 
to make large data archives available over this infrastructure. This project 
investigates techniques by which the government can make large archives 
available to the public over the Internet. A key goal of the project is to 
provide an example for agencies from which they can learn to directly 
provide information to the Internet in a cost-effective manner. 

4.2. The SECs Edgar Database 

EDGAR is the Security and Exchange Commission's Electronic Data 
Gathering, Analysis and Retrieval System. The EDGAR Dissemination 
Service is operated by Mead Data Central, Inc. as the SECs dissemination 
subcontractor. Under the EDGAR rules and phase-in schedule, financial 
firms must begin filing key SEC forms electronically. That information 
becomes available through EDGAR as either a live broadcast- service dr a 
delay ed-dissemination tape service. 

Under the EDGAR schedule, the initial project began July 15, 1992 with 450 
voluntary filers. Beginning April 26, 1993, additional groups of filers joined 
the program as follows: 


April 26, 1993 

Group 01, 

450 Filers 

July 19, 1993 

Group 02 

1203 Filers 

October 4, 1993 

Group 03 

749 Filers 

December 6, 1993 

Group 04 

1000 Filers 



By December 1993, there will be a total of 3,402 filers. At that point, a six 
month test and evaluation period will be conducted. If successful, final 
EDGAR rules will be adopted and all other SEC filers will be brought on-line 
according to the following schedule: 


August, 1994, 

Group 05, 

2299 Filers 

November, 1994 

Group 06, 

2297 Filers 

May, 1995, 

Group 07 

1496 Filers 

August, 1994 

Group 08, 

2299 Filers 

November 1995 

Group 09 

2299 Filers 

May,1996 

Group 10 

808 Filers 


Mead Data provides several levels of dissemination of data. The Level 1 
Broadcast Service is an on-line stream of data using X.25 or TCP/IP protocols. 
It requires subscribers to maintain seven 56 kbps lines (or presumably a single 
T1 1.544 Mbps line) into the Dissemination Service mainframe. Lower- 
volume subsets of the data as well as overnight courier services are available. 


The data in the EDGAR system consists of the following key subsets: 


Subset 1 

Periodic filings by companies on the New York Stock Exchange 
including forms 10-K„ 10-Q„ 8-K„ 10 and all Definitive Proxies. 

Subset 2" 

Periodic filings by companies on the American Stock Exchange 
including all forms in Subset 1. 

Subset 3 

Periodic filings by companies traded over-the-counter 
including all forms in Subset 1. 

Subset 4 

Filings made pursuant to the Williams Act including 
Schedules 12-D„ 14D„ 14D-9,, 13-G„ 13-El,, 13-E3,, and 13-E4. 

Subset 5 

Annual and quarterly reports of all companies pursuant to the 
Securities and Exchange Act of 1934. 

Subset 6 

Securities Act of 1933 Registration Statements including S 
series 

Subset 7 

Stock ownership reports including forms 3„ 4„ 144 

Subset 8 

Periodic reports filed under the Investment Company Act of 
1940 including Forms NSAR-A,, NSAR-B,, and NSAR-U. 


The data itself is formatted as a standard series of files with a Control Block 
header and a series of SGML-tagged data fields. The header indicates the form 
type, the company name, and an indication if the filing is a submission, 
correction, deletion, or other form of document transfer. The data itself 
consists of text files formatted with tags and data values. 

4.3.Data Conversion and Maintenance 




















The project will establish a Level 1 Broadcast service feed to the 
Dissemination Service mainframe. Initial software development will write 
data parsers that translate the incoming data stream into a standard set of 
files. The initial data conversion will use standard tools such as PERL and the 
C Programming Language and will result in a standard file system structure 
appropriate to this data set. 

The data will then be maintained on a Sparcstation server at Internet 
Multicasting Service with a large disk farm. This machine will be connected 
to the Internet and will be configured to support an application level data 
stream of 1.0 Mbps, equivalent to approximately 50 simultaneous anonymous 
FTP sessions. 

The data conversion and maintenance part of the project will be divided into 
two phases: 

• Establishment of the data feed 

• Ongoing processing of the data during the project 

In the initial phase, lasting for the first six months of the project, a dedicated 
T1 line will be established to the EDGAR dissemination service using the 
TCP/IP protocol option. This link will terminate directly in the EDGAR 
dissemination mainframe. 

The other end of the link will terminate on a DSU/CSU, which will cormect 
to a router on a dedicated LAN at the Internet Multicasting Service. That 
LAN will also house a Sparcstation which will provide the main processing 
power for the project. The Sparcstation will be configured with an 8 mm 
backup device and 20 gigabytes of disk space. ^ 

During the irutial project phase, we will establish a procedure for incoming 
data from the Edgar dissemination system to be spooled on the Sparcstation 
disk farm. The initial spooling will be in the raw format defined by the 
dissemination service. 

Using PERL scripts, C programs, and other Unix-based processing tools, the 
project will then begin the initial reformatting of the data. The exact format 
of the file system and the data will be determined by the project team, 
attempting to balance processing requirements, ease of use, completeness, and 
other conflicting factors. 


ISee letter of commitment from INTERNET MULTICASTING SERVICE at 
the end of this project description. 


We anticipate the file structure to consist of a series of directories, one for 
each type of form. In that directory would be a series of files, each coded by 
company name, date of filing, and type of filing (e.g., initial filing, 
amendment, etc...). 

A series of indices and symbolic links will be automatically generated to allow 
direct access to all forms a particular company has filed, or by a particular date, 
or other cross-sections of the data. 

In addition to the initial restructuring of the data, the project will also 
establish verification procedures to determine that data integrity has been 
maintained. The procedures will include statistically valid samples of the 
data on the Sparcstation which will be compared with source documents, as 
well as an analysis of the resulting file structure against the initial data feed. 

Once the initial procedures are put in place, the system will stabilize and will 
continue the processing of information throughout the life of the project. 

This base data will form the platform for public access and will be 
supplemented throughout the project by additional data generated by 
students at New York University. 

4.4. Data Analysis and Access 

The project team will start with the basic data conversion then apply the skills 
of information systems, finance and accounting faculty and students at the 
New York University Stern School of Business. Students will interview 
faculty in management, finance and accounting departments using structured 
techniques to identify specific data access and search requirements to facilitate 
management research as well as classroom, educational and public use of the 
EDGAR data. The student groups under the direction of the principal 
investigator will work with the base data to apply modern resource discovery 
tools and will examine techniques for providing value-added information on 
the base data. 

4.4.1. Network Based Tools 

The project will develop and modify common public domain tools such as 
WAIS, World Wide Web, Gopher, Veronica, Viola, Mosaic and others as 
applicable to work within the specific constraints of large government 
databases. The project will evaluate techniques such as custom WAIS 
vocabularies to aid in searching. 

In addition to the standard resource discovery tools, the project will use 
anonymous FTP, e-mail based information servers, and other low-tech 
methods to make the data available. The project will investigate how to 


make a large, complex mass of information such as the EDGAR database 
easily accessible by low-bandwidth users with tools such as basic e-mail. 

The project will begin by immediately establishing access to information with 
anonymous FTP and a public domain e-mail-based server. This basic, raw 
access will allow immediate access to Edgar information for researchers 
willing to work with the data in raw format. 

The next level of value-added service to be provided will index the 
information for inclusion in a WAIS database. The commercial version of 
WAIS will be used because the indices are more efficient and the server code 
has higher performance. However, any WAIS client will be able to access the 
information. 

Access by Gopher and hypertext links in the World Wide Web system will 
take a little longer to establish since these systems establish structure on the 
data. We anticipate a rough version of a Gopher server to be immediately 
established, but a final version of the menus will be developed during the 
data administration and aggregation portions of the project. 

4.4.2. Data Administration and Aggregation 

In addition to providing and modifying common tools for access to the data, 
the project team will investigate techniques for providing value-added 
information in a semi-automatic fashion. Data-specific summaries such as 
indices and abstracts can be automatically generated. Multiple iterations of 
data (original filings plus additional corrections) can be folded into a single 
combined piece of information. Aggregations of data can be provided on an 
industry, market, or other aggregation basis. 

Initially, the program will start by determirung the proper way to structure 
the information to allow simple, menu-based navigation of the Edgar data. 
Combined with the WAIS free-text search capabilities, a highly functional 
interface to the data can be established. 

A more sophisticated interface into the information can be provided using 
hypertext links, linking one portion of the database to another portion (or to 
other databases on the Internet). The project will determine which pieces of 
information in the database can be related to other pieces, then link those 
pieces together. 

In this stage of the project, students from finance and information systems 
will work closely together to combine the semantic content of the data with 
the advanced tools. The result will be Gopher, WWW, WAIS, and other 
interfaces into the information which will provide the general public and 
researchers with a powerful, flexible interface to the Edgar data. 



4.4.3. Advanced Topics 


In addition to basic research topics detailed above, the project will investigate 
several novel distribution techniques. First, the project will use IP 
multicasting to move the database around the world as very low volume (7-9 
kbps) data streams. Software for receiving the data and setting complex traps 
for specific pieces of information will be developed and deployed and this 
software will be integrated into the Session Directory (SD) software running 
on the Multicast Backbone (MBONE). 

4.5. Timelines 

The project will begin on October 1, 1993 and will run for two years. At the 
close of the second year, the National Science Foundation may elect to 
continue the project with the specific aim of providing a direct transition of 
the system into the SEC information systems division. Such transition is 
dependent on SEC evaluation of the technical feasibility of operating a 
database server on the Internet. 

In the first six months, the basic data structure will be established and the 
server will become operational. The second six months of the project will 
provide access to the data on the Internet using basic interfaces such as 
Gopher, anonymous FTP, and e-mail. Basic WAIS access will be provided 
during the stage. 

In the second year of the project, the advanced interfaces based on Gopher, 
WWW, and WAIS will be established and the multicast based deployment of 
the data will be initially deployed. 

Throughout the project, frequent briefings will be provided to interested 
parties, including officials of the Securities and Exchange Commission. Such 
briefings, supplemented by regular project reports, will allow the SEC to 
monitor the progress of the project and to evaluate the potential for 
establishing a similar service within the SEC. 

4.6. Project Reporting 

Grantee shall provide periodic project briefings to interested parties, 
including government agencies, researchers in academic institutions, 
librarians, and others as appropriate. These briefings will detail the status of 
the project and identify key issues. 

Principal Investigator and project participants shall provide full information 
to interested government agencies including the Securities and Exchange 


Commission (SEC) with a view of providing interested agencies with the 
tools to perform similar functions internally. 

4.6.1. Oversight Committee 

New York University and the Principal Investigator shall establish a project 
oversight committee to provide advice on project progress and plans. The 
committee shall be chaired by a professor from the Stern School of Business 
and will include representatives from key interested parties, including the 
academic and research community, the financial services industry, the retail 
information industry, the Network Information Center (InterNIC) and others 
as appropriate. 

The Oversight Committee shall meet twice annually. Principal Investigator 
and project participants shall present an overview of project progress and the 
Oversight Committee will make recommendations. The results of these 
meetings shall be included in the next reports to the NSF Program Director 
including a summary of items discussed and action items to be taken. 

4.6.2. Timely Notification of Significant Problems 

The Grantee will inform the NSF Program Official (either by e-mail or in 
writing) in a timely manner of any significant problems or events that could 
affect the overall schedule or progress in the program. 

4.6.3. Verbal Reports and Liaison 

The Grantee will meet on an informal basis, as necessary or requested, with 
the NSF Program Director to review progress to date and to exchange views, 
ideas, and information concerning the program. 

When requested by the NSF Program Director, Grantee will arrange to have 
its subawardees in attendance at meetings which deal with their areas of 
activity. In addition, at the request of the NSF Program Director, the Grantee 
will arrange on-site meetings for the Program Officer, other Federal staff 
and/or representatives of the world-wide Internet community and the 
Grantee's professional personnel, and/or those of its subawardees. 

4.6.4. Monthly Letter Progress Reports 

Monthly letter progress reports will be submitted electronically to the NSF 
Program Official and NSF Administrative Official at the address shown on 
the cover page. These monthly letter progress reports will be submitted in 
such detail and format as required by the Foundation's Program Director and 
will contain statistical and narrative information on the performance of the 
Grantee during the preceding month. 


4.6.5. Quarterly Status Report 


Grantee will prepare and furnish electronically and in four hard (4) copies 
quarterly letter status reports; the first quarterly status report will be for the 
period from October 1, 1993, through December 31, 1993. These reports will 
show the status of all major events and summaries and major work 
performed during the quarter, including technical status, accomplishments, 
problems, collaboration activities, changes in future plans, and any pending 
requests for NSF approval and will be fully reconciled with the information, 
goals and projections contained in the Annual Report and Program Plan. 

The report will also include a summary of award expenditures and line 
charges both cumulative and for the current quarter. 

The report will be prepared on a quarterly basis and shall be submitted within 
thirty (30) days after the reporting period ends. No quarterly report will be 
submitted for the quarter in which the Annual reports are submitted, but. 
Awardee will insure that any germane information for the quarter not 
contained in the Annual Report (i.e, list of pending requests for NSF 
approval) are submitted by separate letter. 


4.6.6. Final Report 

The Grantee will submit electronically and in three (3) hard copies a final 
report to NSF at the conclusion of the Cooperative Agreement. The final 
report will contain a description of all work performed and problems 
encountered (and if requested a copy and documentation of any and all 
software and data generated) in such form and sufficient detail as to permit 
replication of the work by a reasonably knowledgeable party or organization. 


4.7. Project Benefits 

The project will provide the following primary benefits: 

• Demonstration of how to convert large government data bases for 
public and research access through the Internet. 

• Demonstration of new techniques for data distribution in the 
Internet. 


• Modified resource discovery tools and management techniques for 
converting government archives for public and research data access. 


• Improved public, student and faculty access to information resources 
for research and teaching in finance, accounting and management. 

• Provision of a large data archive for information systems and 
computer science research on knowledge discovery in temporal databases. 

Finally, the NYU Stern School of Business provides an especially useful 
infrastructure and resource base for implementing this project. The 
Information Systems Department and the Finance departments of the Stern 
School are both nationally ranked as among the top ten nationwide in terms 
of research and scholarship. The school, faculty and students have excellent 
relations with, and are very knowledgeable about the financial services 
industry. The school has excellent Internet access and computing 
infrastructure, including a backbone FDDI ring and ethernet connections to 
desktop computers in the Stern School of Business, Management Education 
Center. These resources will be utilized in this project. 
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June 30, 1993 


Professor Ajit Kambil 
Department of Information Systems 
Leonard N. Stern School of Business 
New York University 

Management Education Center, Suite 9-170 
New York, NY 10012-1126 

Dear Dr. Kambil: 

I am writing at your request to propose a subcontract for your National Science 
Foundation proposal on the new "Internet Direct EDGAR Access System." The 
Internet Multicasting Service would be pleased to provide core data processing 
functions and to work with you and your students as an Integral part of this ex- 
citing research project. 

Specifically, we propose a subcontract for the amounts of $168,800 for FY94 
and $83,000 for FY95. Under this contract, we would procure, operate, and 
support a Sparcstation system with 20 Gigabytes of disk space and Internet con- 
nectivity capable of serving data to the Internet at a sustained throughput rate 
of 1.0 Mbps at the application layer. This is roughly the equivalent of 50 simul- 
taneous File Transfer Protocol (FTP) sessions. 

In addition, we will actively work with you and other project participants to se- 
cure funding for the EDGAR Dissemination Data Feed. Once the data feed is es- 
tablished, we will coordinate the technical establishment of that data feed and 
will process the incoming data in a manner suitable for access by the Internet 
community. We will perform such tasks for a two-year period, as per your pro- 
posal to the National Science Foundation. 

For purposes of the subcontract, I will be considered a "key personnel" and you 
will have the right to approve any changes of key personnel. In addition, I 
will devote additional staff of the equivalent of 1.0 FTE as needed to perform 
data processing and support tasks. We agree to participate in project briefings, 
to contribute to project reports, and to otherwise work with you and your stu- 
dents to support this project under the scope of work as outlined in your pro- 
posal to the National Science Foundation. 

A PRODUCTION OF THE INTERNET MULTICASTING SERVICE 


SUITE 1155 • THE NATIONAL PRESS BUILDING • WASHINGTON, D.C. 20045 
TEL: (202) 628-2044 • FAX: (202) 628-2042 • EMAIL: CarI@Radio.COM 


INTERNET 
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Professor Ajit Kambil, Page 2 

The Internet Multicasting Service is a Delaware Corporation with offices in 
Washington, D.C. We have flied for tax-exempt status with the IRS and D.C. 
authorities and our by-laws are drafted as a public benefit corporation. We will 
be contributing cost sharing to this project in the form of personnel ($20,000 
per year), subsidization of Internet connectivity ($25,000 per year), and addi- 
tional equipment needed for the project with a value of $100,000. 

Work on this subcontract can commence upon approval by the National Sci- 
ence Foundation of the project, our successful establishment of funding for an 
EDGAR Dissemination Service data feed from the private sector or other 
sources, and the didiursement of funds by NYU to enable us to procure the 
base hardware system needed for the project. 

I look forward to a long and fiuitful collaboration with you and the Stern 
School of Business. 


Sincerely, 



Carl Malamud 
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Office: [212]-998-0843 
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Research 

My research is focused on three related themes: the effects and use of information 
technology to transform industrial structure and competition; processes and methods for 
alignment of business and information technology strategies; and the effective design and 
use of telecommunications and collaborative technologies as a business resource. My 
current projects include: an extended study of the impact of electronic tax filing on structure 
and competition in tax preparation and related markets, an assessment of outsourcing 
practices and its use to align business and infonnation systems strategies, and a study of 
techniques for the effective use of the Internet as an educational and b^usiness resource. 

Recent Articles: 

"Information Technology and Vertical Integration: Evidence from the Manufacturing 
Sector," in M.G. Calvert and S. Wildman (eds). Electronic Service Networks: A Business 
and Policy Challenge, Praeger Publishing Co., December 1990. 

"Strategies for Electronic Integration: Lessons from Electronic Filing of Income Taxes," 
Sloan Management Review, Winter 1991, with N. Venkatraman. 

"Electronic Integration: A Critical Review and Network Extensions," in R.J Strong etal 
(eds) Scientific Research on EDI, Samsom Publishing, 1992 
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Ajit Kambil 

Assistant Professor 
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Leonard N. Stern School of Business 

New York University 
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"Sti’ategic Management of Information Technology Investments: An Options Perspective" 
in R.D Banker, RJ Kauffman, M.A Mahmood (eds). Strategic Information Technology 
Management: Perspectives on Organizational Growth and Competitive Advantage, Idea 
Group Publishing, 1993 

Collaborators 

My dissertation committee was chaired by Jack Rockart and included N. Venkatraman, 
John Henderson, and Benn Konsynski 

My collaborators also include Jim Short, Henry Lucas, Jon Turner, Ken Laudon. I am 
supervising research with three doctoral students; Luis Martins, Michael Fish and Alan 
Zimmerman. 



Carl Malamud 
President 

Internet Multicasting Sei'vice 
Suite 1155 

The National Press Building 
Washington D.C 20045 


Carl Malamud is President of the Internet Multicasting Service, an organization that is 
providing large-scale dissemination of data onto the Internet. The Internet Multicasting 
Service is a public-interest corporation with headquarters in the National Press Building in 
Washington, D.C. 

Malamud is the author of seven professional reference texts in the fields of networking and 
database systems and is active in the Internet Engineering Task Force. He was project 
coordinator for conversion and posting of technical standards developed by the 
International Telecommunication Union, a system that successfully used the Internet for 
dissemination of large data archives. The ITU has since successfully taken the function 
over and is providing distribution of key technical standai'ds over the Internet. 

Malamud has seiwed as a consultant to many government groups including Argonne 
National Laboratory, Lawrence Livemiore National Laboratory, the Joint Chiefs of Staff, 
the Board of Governors of the Federal Reserve System, the Office of the Secretary for 
Defense, and the Defense Technical Information Center. He has served as a consultant to 
many firms in the computer industiy, including Sun Microsystems, Control Data 
Corporation, Hewlett-Packard, Xerox, Digital Equipment Corporation, and many others. 
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7.3 Budget Explanation 


Personnel: One month summer salary for the first year and 1/3 academic year 
salary for the second year is requested by the principal investigator. Support 
for one principal graduate assistant on a full-time basis and two other 
graduate students on a part time basis (0.5 FTE) is requested for two academic 
years and the summer. 

Wage increases are budgeted at 5% per year. The current rate of fringe benefits 
is 29%, increasing by 1% per year. Graduate student tuition is budgeted at 51% 
of salary. 

Equipment: 

The Principal Investigator has a Mac Quadra 700 as his primary machine 
provided by the academic institution. We request an IBM compatible Intel 
486, machine with 200 MB of disk, 20 MB of RAM, with a 16 " color monitor. 
Given the dominance of the Windows and DOS environments, this will be 
the primary platform for developing and testing Window's compatible user 
interfaces to EDGAR data. A 1 Gigabyte SCSI hard disk is required for local 
storage and a hublet is also required to enable multiple machines to connect 
to the network. 

In addition a licence for WAIS server software for a New York University 
Sparcstation is required to provide a local WAIS server site for testing WAIS 
vocabularies on sample data from Edgar. 

Travel: 

Funds are requested for the Principal Investigator and students to visit the 
SEC and the Internet Multicasting Service in Washington D.C for project 
coordination and work at least three times a year. In addition the Principal 
Investigator requests travel funds to the International Conference on 
Information Systems (Florida '93 and Vancouver '94) and INET and National 
Net Conferences to disseminate research results and reports. 

Other Direct Costs 

We have included a small sum for printing to primarily support notification 
of business school faculty of the availability of new data on the Internet for 
research as well as dissemination of preliminary documentation for access 
and research results. 

Money was also allocated for a project oversight committee to visit and 
advise the principal investigators and their team on the progress of the 



project. This money will enable up to 5 committee members to visit New 
York from out of state twice a year for two days. 

Intenet Multicasting Service - Fixed Price Contract 

Internet Multicasting Service will provide data processing and maintain the 
Sparcstation and disk farm to support the project. In addition they will 
provide initial conversion and formatting of the data, maintain a server 
capable of 1 Mbps applications level throughput, and implement multicast 
based data distribution. They will also provide technical assistance for the 
project and participate in project meetings, reporting and other activities as 
per the letter of commitment attached to this proposal. 


Other Costs 

Total NSF costs do not include cost-sharing contributions. New York 
University will cost share in accordance with NSF policy and contribute 
graduate student support valued at $40,000. The fixed price subcontract from 
Internet Multicasting Service includes cost sharing of Internet access facilities, 
hardware, and personnel valued at $55,000. Additional cost sharing of 
$138,000 per year will be contributed by other organizations for the Edgar 
Dissemination Service data feed. 


8. Current and Pending Support 
Ajit Kambil 


None 


